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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT: OLSEN, HENRIK S. 



RUBEN, STEVEN 
NI, JIAN 
MURPHY, MARIANNE 
GENTZ , REINER 



(ii) TITLE OF INVENTION: FC RECEPTORS AND POLYPEPTIDES 



(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: US 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BROOKES, ALLAN A. 

(B) REGISTRATION NUMBER: 3 6,373 

(C) REFERENCE /DOCKET NUMBER: PF3 63 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (3010 309-8512 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTKRI ST ICS : 

(A) LENGTH: 1552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(iii) 



NUMBER OF SEQUENCES: 32 



(ii) 



MOLECULE TYPE: DNA (genomic) 



(ix) 



FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 82.. 1362 
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(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 82.. 142 

(ix) FEATURE: 

(A) NAME/KEY: mafc_peptide 

(B) LOCATION: 145.. 1362 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCAGGAATTC GGCACGAGCC TCTGTCCTGC CAGCACCGAG GGCTCATCCA TCCACAGAGC 60 

AGTGCAGTGG GAGGAGACGC C ATG ACC CCC ATC CTC ACG GTC CTG ATC TGT 111 

Met Thr Pro lie Leu Thr Val Leu He Cys 
-21 -20 -15 

CTC GGG CCC CTC CCC AAG CCC ACC CTC TGG GCT GAG CCA GGC TCT GTG 159 
Leu Gly Pro Leu Pro Lys Pro Thr Leu Trp Ala Glu Pro Gly Ser Val 
-10 -5 15 

ATC ACC CAA GGG AGT CCT GTG ACC CTC AGG TGT CAG GGG AGC CTG GAG 207 
He Thr Gin Gly Ser Pro Val Thr Leu Arg Cys Gin Gly Ser Leu Glu 
10 15 20 

ACG CAG GAG TAC CAT CTA TAT AGA GAA AAG AAA? ACA GCA CTC TGG ATT 255 
Thr Gin Glu Tyr His Leu Tyr Arg Glu Lys Lys Thr Ala Leu Trp He 
25 30 35 

ACA CGG ATC CCA CAG GAG CTT GTG AAG AAG GGC CAG TTC CCC ATC CTA 303 
Thr Arg He Pro Gin Glu Leu Val Lys Lys Gly Gin Phe Pro He Leu 
40 45 50 

TCC ATC ACC TGG GAA CAT GCA GGG CGG TAT TGC TGT ATC TAT GGC AGC 351 
Ser He Thr Trp Glu His Ala Gly Arg Tyr Cys Cys He Tyr Gly Ser 
55 60 65 

CAC ACT GCA GGC CTC TCA GAG AGC AGT GAC CCC CTG GAG CTG GTG GTG 399 
His Thr Ala Gly Leu Ser Glu Ser Ser Asp Pro Leu Glu Leu Val Val 
70 75 80 85 

ACA GGA GCC TAC AGC AAA CCC ACC CTC TCA GCT CTG CCC AGC CCT GTG 447 
Thr Gly Ala Tyr Ser Lys Pro Thr Leu Ser Ala Leu Pro Ser Pro Val 
90 95 100 

GTG ACC TCA GGA GGG AAT GTG ACC ATC CAG TGT GAC TCA CAG GTG GCA 495 
Val Thr Ser Gly Gly Asn Val Thr He Gin Cys Asp Ser Gin Val Ala 
105 110 H5 

TTT GAT GGC TTC ATT CTG TGT AAG GAA GGA GAA GAT GAA CAC CCA CAA 543 
Phe Asp Gly Phe He Leu Cys Lys Glu Gly Glu Asp Glu His Pro Gin 
120 125 130 

TGC CTG AAC TCC CAT TCC CAT GCC CGT GGG TCA TCC CGG GCC ATC TTC 591 
Cys Leu Asn Ser His Ser His Ala Arg Gly Ser Ser Arg Ala He Phe 
135 140 145 

TCC GTG GGC CCC GTG AGC CCA AGT CGC AGG TGG TCG TAC AGG TGC TAT 639 
Ser Val Gly Pro Val Ser Pro Ser Arg Arg Trp Ser Tyr Arg Cys Tyr 
150 155 160 165 



GGT TAT GAC TCG CGC GCT CCC TAT GTG TGG TCT CTA CCC AGT GAT CTC 
Gly Tyr Asp Ser Arg Ala Pro Tyr Val Trp Ser Leu Pro Ser Asp Leu 



687 



72 



170 175 180 

CTG GGG CTC CTG GTC CCA GGT GTT TCT AAG AAG CCA TCA CTC TCA GTG 735 
Leu Gly Leu Leu Val Pro Gly Val Ser Lys Lys Pro Ser Leu Ser Val 
185 190 195 

CAG CCG GGT CCT GTC GTG GCC CCT GGG GAG AAG CTG ACC TTC CAG TGT 783 
Gin Pro Gly Pro Val Val Ala Pro Gly Glu Lys Leu Thr Phe Gin Cys 
200 205 210 

GGC TCT GAT GCC GGC TAC GAC AGA TTT GTT CTG TAC AAG GAG TGG GGA 831 
Gly Ser Asp Ala Gly Tyr Asp Arg Phe Val Leu Tyr Lys Glu Trp Gly 
215 220 225 

CCT GAC TTC CTC CAG CGC CCT GGC CGG CAG CCC CAG GCT GGG CTC TCC 879 
Arg Asp Phe Leu Gin Arg Pro Gly Arg Gin Pro Gin Ala Gly Leu Ser 
230 235 240 245 

CAG GCC AAC TTC ACC CTG GGC CCT GTG AGC CGC TCC TAC GGG GGC CAG 927 
Gin Ala Asn Phe Thr Leu Gly Pro Val Ser Arg Ser Tyr Gly Gly Gin 
250 255 260 

TAC ACA TGC TCC GGT GCA TAC AAC CTC TCC TCC GAG TGG TCG GCC CCC 975 
Tyr Thr Cys Ser Gly Ala Tyr Asn Leu Ser Ser Glu Trp Ser Ala Pro 
265 270 275 

AGC GAC CCC CTG GAC ATC CTG ATC ACA GGA CAG ATC CGT GCC AGA CCC 1023 
Ser Asp Pro Leu Asp lie Leu lie Thr Gly Gin lie Arg Ala Arg Pro 
280 285 290 

TTC CTC TCC GTG CGG CCG GGC CCC ACA GTG GCC TCA GGA GAG AAC GTG 1071 
Phe Leu Ser Val Arg Pro Gly Pro Thr Val Ala Ser Gly Glu Asn Val 
295 300 305 

ACC CTG CTG TGT CAG TCA CAG GGA GGG ATG CAC ACT TTC CTT TTG ACC 1119 
Thr Leu Leu Cys Gin Ser Gin Gly Gly Met His Thr Phe Leu Leu Thr 
310 315 320 325 

AAG GAG GGG GCA GCT GAT TCC CCG CTG CGT CTA AAA TCA AAG CGC CAA 1167 
Lys Glu Gly Ala Ala Asp Ser Pro Leu Arg Leu Lys Ser Lys Arg Gin 
330 335 340 

TCT CAT AAG TAC CAG GCT GAA TTC CCC ATG ACT CCT GTG ACC TCG GCC 1215 
Ser His Lys Tyr Gin Ala Glu Phe Pro Met Ser Pro Val Thr Ser Ala 
345 350 355 

CAC GCG GGG ACC TAC AGG TGC TAC GGC TCA CTC AGC TCC AAC CCC TAC 1263 
His Ala Gly Thr Tyr Arg Cys Tyr Gly Ser Leu Ser Ser Asn Pro Tyr 
360 365 370 

CTG CTG ACT CAC CCC AGT GAC CCC CTG GAG CTC GTG GTC TCA GGA GCA 1311 
Leu Leu Thr His Pro Ser Asp Pro Leu Glu Leu Val Val Ser Gly Ala 
375 380 385 

GCT GAG ACC CTC AGC CCA CCA CAA AAC AAG TCC GAC TCC AAG GCT GGT 1359 
Ala Glu Thr Leu Ser Pro Pro Gin Asn Lys Ser Asp Ser Lys Ala Gly 
390 395 400 405 

GAG TGAGGAGATG CTTGCCGTGA TGACGCTGGG CACAGAGGGT CAGGTCCTGT 1412 
Glu 



CAAGAGGAGC TGGGTGTCCT GGGTGGACAT TTGAAGAATT ATATTCATTC CAACTTGAAG 1472 



73 



AATTATTCAA CACCTTTAAC AATGTATATG TGAAGTACTT TATTCTTTCA TATTTTAAAA 1532 



ATAAAAGATA ATTATCCATG 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Pro lie Leu Thr Val Leu lie Cys Leu Gly Pro Leu Pro Lys 
-21 -20 -15 -10 

Pro Thr Leu Trp Ala Glu Pro Gly Ser Val lie Thr Gin Gly Ser Pro 
-5 1 • 5 10 

Val Thr Leu Arg Cys Gin Gly Ser Leu Glu Thr Gin Glu Tyr His Leu 
15 20 25 

Tyr Arg Glu Lys Lys Thr Ala Leu Trp lie Thr Arg lie Pro Gin Glu 
30 35 40 

Leu Val Lys Lys Gly Gin Phe Pro lie Leu Ser He Thr Trp Glu His 
45 50 55 

Ala Gly Arg Tyr Cys Cys He Tyr Gly Ser His Thr Ala Gly Leu Ser 
60 65 70 75 

Glu Ser Ser Asp Pro Leu Glu Leu Val Val Thr Gly Ala Tyr Ser Lys 
80 85 90 

Pro Thr Leu Ser Ala Leu Pro Ser Pro Val Val Thr Ser Gly Gly Asn 
95 100 105 

Val Thr He Gin Cys Asp Ser Gin Val Ala Phe Asp Gly Phe He Leu 
110 115 120 

Cys Lys Glu Gly Glu Asp Glu His Pro Gin Cys Leu Asn Ser His Ser 
125 130 135 

His Ala Arg Gly Ser Ser Arg Ala He Phe Ser Val Gly Pro Val Ser 
140 145 150 155 

Pro Ser Arg Arg Trp Ser Tyr Arg Cys Tyr Gly Tyr Asp Ser Arg Ala 
160 165 170 

Pro Tyr Val Trp Ser Leu Pro Ser Asp Leu Leu Gly Leu Leu Val Pro 
175 180 185 

Gly Val Ser Lys Lys Pro Ser Leu Ser Val Gin Pro Gly Pro Val Val 
190 195 200 

Ala Pro Gly Glu Lys Leu Thr Phe Gin Cys Gly Ser Asp Ala Gly Tyr 
205 210 215 



1552 



Asp Arg Phe Val Leu Tyr Lys Glu Trp Gly Arg Asp Phe Leu Gin Arg 



74 



220 



225 230 235 



Pro Gly Arg Gin Pro Gin Ala Gly Leu Ser Gin Ala Asn Phe Thr Leu 
240 245 250 

Gly Pro Val Ser Arg Ser Tyr Gly Gly Gin Tyr Thr Cys Ser Gly Ala 
255 260 265 

Tyr Asn Leu Ser Ser Glu Trp Ser Ala Pro Ser Asp Pro Leu Asp He 
270 275 280 

Leu He Thr Gly Gin He Arg Ala Arg Pro Phe Leu Ser Val Arg Pro 
285 290 295 

Gly Pro Thr Val Ala Ser Gly Glu Asn Val Thr Leu Leu Cys Gin Ser 
300 305 310 315 

Gin Gly Gly Met His Thr Phe Leu Leu Thr Lys Glu Gly Ala Ala Asp 
320 325 330 

Ser Pro Leu Arg Leu Lys Ser Lys Arg Gin Ser His Lys Tyr Gin Ala 
335 340 345 

Glu Phe Pro Met Ser Pro Val Thr Ser Ala His Ala Gly Thr Tyr Arg 
350 355 360 

Cys Tyr Gly Ser Leu Ser Ser Asn Pro Tyr Leu Leu Thr His Pro Ser 
365 370 375 

Asp Pro Leu Glu Leu Val Val Ser Gly Ala Ala Glu Thr Leu Ser Pro 
380 385 390 395 

Pro Gin Asn Lys Ser Asp Ser Lys Ala Gly Glu 
400 405 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 37.. 826 

( ix) FEATURE : 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 37. .88 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 91.. 826 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



ACCCACGCGT CCGCACTCTA GCGGTATCTG CCCACC ATG GCC CTG GTG CTG ATC 

Met Ala Leu Val Leu He 



75 



-18 -15 

CTC CAG CTG CTG ACC CTC TGG CCT CTG TGT CAC ACA GAC ATC ACT CCG 102 
Leu Gin Leu Leu Thr Leu Trp Pro Leu Cys His Thr Asp He Thr Pro 
-10 -5 1 

TCT GTC CCC CCA GCT TCA TAC CAC CCT AAG CCA TGG CTG GGA GCT CAG 150 
Ser Val Pro Pro Ala Ser Tyr His Pro Lys Pro Trp Leu Gly Ala Gin 
5 10 15 20 

CCG GCT ACA GTT GTG ACC CCT GGG GTC AAC GTG ACC TTG AGA TGC CGG 198 
Pro Ala Thr Val Val Thr Pro Gly Val Asn Val Thr Leu Arg Cys Arg 
25 30 35 

GCA CCC CAA CCC GCT TGG AGA TTT GGA CTT TTC AAG CCT GGA GAG ATC 246 
Ala Pro Gin Pro Ala Trp Arg Phe Gly Leu Phe Lys Pro Gly Glu He 
40 45 50 



GCT CCC CTT CTC TTC CGG GAT GTG TCC TCC GAG CTG GCA GAA TTC TTT 
Ala Pro Leu Leu Phe Arg Asp Val Ser Ser Glu Leu Ala Glu Phe Phe 
55 60 65 



CGA AGG CCA GAC TGG GGG CCG GGT GTC TGG TCC CAG CCC AGC GAT GTC 
Arg Arg Pro Asp Trp Gly Pro Gly Val Trp Ser Gin Pro Ser Asp Val 
85 90 95 100 



294 



CTG GAG GAG GTG ACT CCA GCC CAA GGG GGA AGT TAC CGC TGC TGC TAC 342 
Leu Glu Glu Val Thr Pro Ala Gin Gly Gly Ser Tyr Arg Cys Cys Tyr 
70 75 80 



390 



CTG GAG CTG CTG GTG ACA GAG GAG CTG CCG CGG CCG TCG CTG GTG GCG 438 
Leu Glu Leu Leu Val Thr Glu Glu Leu Pro Arg Pro Ser Leu Val Ala 
105 HO H5 

CTG CCC GGG CCG GTG GTG GGT CCT GGC GCC AAC GTG AGC CTG CGC TGC 486 
Leu Pro Gly Pro Val Val Gly Pro Gly Ala Asn Val Ser Leu Arg Cys 
120 125 130 

GCG GGC CGC CTG CGG AAC ATG AGC TTC GTG CTG TAC CGC GAG GGC GTG 534 
Ala Gly Arg Leu Arg Asn Met Ser Phe Val Leu Tyr Arg Glu Gly Val 
135 140 145 

GCG GCC CCG CTG CAG TAC CGC CAC TCC GCG CAG CCC TGG GCC GAC TTC 582 
Ala Ala Pro Leu Gin Tyr Arg His Ser Ala Gin Pro Trp Ala Asp Phe 
150 155 160 

ACG CTG CTG GGC GCC CGC GCC CCC GGC ACC TAC AGC TGC TAC TAT CAC 630 
Thr Leu Leu Gly Ala Arg Ala Pro Gly Thr Tyr Ser Cys Tyr Tyr His 
165 170 175 180 

ACG CCC TCC GCG CCC TAC GTG CTG TCG CAG CGC AGC GAG GTG CTG GTC 
Thr Pro Ser Ala Pro Tyr Val Leu Ser Gin Arg Ser Glu Val Leu Val 
185 190 195 

ATC AGC TGG GAA GAC TCT GGC TCC TCC GAC TAC ACC CGG GGG AAC CTA 726 
He Ser Trp Glu Asp Ser Gly Ser Ser Asp Tyr Thr Arg Gly Asn Leu 
200 205 210 

GTC CGC CTG GGG CTG GCC GGG CTG GTC CTC ATC TCC CTG GGC GCG CTG 774 
Val Arg Leu Gly Leu Ala Gly Leu Val Leu He Ser Leu Gly Ala Leu 
215 220 225 



678 



GTC ACT TTT GAC TGG CGC AGT CAG AAC CGC GCT CCT GCT GGT ATC CGC 



822 
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Val Thr Phe Asp Trp Arg Ser Gin Asn Arg Ala Pro Ala Gly lie Arg 
230 235 240 

CCC T GAGCCCCAGG AGCACTGCAG CCCGAGACTT CCAACCTGAG TGGCGGAGAA 876 

Pro 

245 

GCTGGGACCC TGGGCTGGAC TGTCCTTTCC TGCAGCCCCA CAGTCCTGCT GGCTGAGCTC 936 

CGCGGAACGG TCCTTAGACC CCGCTGTGCC CTGTGCTGTA GCTTCTTTCC AGGCCTTTCC 996 

CAAGGAGTAG CTGAAAGGAA GACGCGATTA GTGGTTAAGA CTTCCAAGCC AGAAGACAGA 1056 

GGGTTCGAAT CCCAGCACTG CCGTCTACTC ACTGTAGTAG TAGCAGCTAC AGAAAGGTAG 1116 

TAGTGAGACG TGAAGCCAGC TGGACTTCCT GGGTTGAATG GGGACCTGGA GAACTTTTCT 1176 

GTCTTACAAG AGGATTGTAA AATGGACCAA TCAGCACTCT GTAAGATGGA CCAATCAGCG 1236 

CTCTGTAAAA TGGACCAATC AGCAGGACAT GGGCGGGGAC AATAAGGGAA TAAAAGCTGG 1296 

CGAGCGCGGC ACCCCACCAG AGTCTGCTTC CACGCTGTGG GAGCTTTGTT CTCTTGCTCT 1356 

ACACAATAAA TCTTGCTGCT GCTAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 1410 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Leu Val Leu lie Leu Gin Leu Leu Thr Leu Trp Pro Leu Cys 
-18 -15 -10 -5 

His Thr Asp lie Thr Pro Ser Val Pro Pro Ala Ser Tyr His Pro Lys 
15 10 

Pro Trp Leu Gly Ala Gin Pro Ala Thr Val Val Thr Pro Gly Val Asn 
15 20 25 30 

Val Thr Leu Arg Cys Arg Ala Pro Gin Pro Ala Trp Arg Phe Gly Leu 
35 40 45 

Phe Lys Pro Gly Glu lie Ala Pro Leu Leu Phe Arg Asp Val Ser Ser 
50 55 60 

Glu Leu Ala Glu Phe Phe Leu Glu Glu Val Thr Pro Ala Gin Gly Gly 
65 70 75 

Ser Tyr Arg Cys Cys Tyr Arg Arg Pro Asp Trp Gly Pro Gly Val Trp 
80 85 90 

Ser Gin Pro Ser Asp Val Leu Glu Leu Leu Val Thr Glu Glu Leu Pro 
95 100 105 110 

» 

Arg Pro Ser Leu Val Ala Leu Pro Gly Pro Val Val Gly Pro Gly Ala 
115 120 125 
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Asn Val Ser Leu Arg Cys Ala Gly Arg Leu Arg Asn Met Ser Phe Val 
130 135 140 

Leu Tyr Arg Glu Gly Val Ala Ala Pro Leu Gin Tyr Arg His Ser Ala 
145 150 155 

Gin Pro Trp Ala Asp Phe Thr Leu Leu Gly Ala Arg Ala Pro Gly Thr 
160 165 170 

Tyr Ser Cys Tyr Tyr His Thr Pro Ser Ala Pro Tyr Val Leu Ser Gin 
175 180 185 190 

Arg Ser Glu Val Leu Val lie Ser Trp Glu Asp Ser Gly Ser Ser Asp 
195 200 205 

Tyr Thr Arg Gly Asn Leu Val Arg Leu Gly Leu Ala Gly Leu Val Leu 
210 215 220 

lie Ser Leu Gly Ala Leu Val Thr Phe Asp Trp Arg Ser Gin Asn Arg 
225 230 235 

Ala Pro Ala Gly lie Arg Pro 
240 245 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 1942 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 73.. 118 

( ix) FEATURE : 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 121.. 1942 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCAGGAATTC GGCACGAGCA GCACTGAGGG CTCATCCCTC TGCAGAGCGC GGGGTCACCG 

GAAGGAGACG CC ATG ACG CCC GCC CTC ACA GCC CTG CTC TGC CTT GGG 
Met Thr Pro Ala Leu Thr Ala Leu Leu Cys Leu Gly 
-16 -15 -10 -5 

CTG AGT CTG GGC CCC AGG ACC CGC GTG CAG GCA GGG CCC TTC CCC AAA 
Leu Ser Leu Gly Pro Arg Thr Arg Val Gin Ala Gly Pro Phe Pro Lys 
15 10 

CCC ACC CTC TGG GCT GAG CCA GGC TCT GTG ATC AGC TGG GGG AGC CCC 
Pro Thr Leu Trp Ala Glu Pro Gly Ser Val He Ser Trp Gly Ser Pro 



78 



15 20 25 



GTG ACC ATC TGG TGT CAG GGG AGC CTG GAG GCC CAG GAG TAC CAA CTG 252 
Val Thr lie Trp Cys Gin Gly Ser Leu Glu Ala Gin Glu Tyr Gin Leu 
30 35 40 



GAT AAA GAG GGA AGC CCA GAG CCC TTG GAC AGA AAT AAC CCA CTG GAA 
Asp Lys Glu Gly Ser Pro Glu Pro Leu Asp Arg Asn Asn Pro Leu Glu 

50 55 60 



45 



GTG TCT AGG AAG CCC TCC CTC CTG ACC CTG CAG GGC CCT GTC CTG GCC 
Val Ser Arg Lys Pro Ser Leu Leu Thr Leu Gin Gly Pro Val Leu Ala 
205 " 210 215 220 



GGC CAG CAG CCC CAG GCT GGG CTC TCC CAG GCC AAC TTC ACC CTG GGC 
Gly Gin Gin Pro Gin Ala Gly Leu Ser Gin Ala Asn Phe Thr Leu Gly 
255 260 265 

CCT GTG AGC CCC TCC AAT GGG GGC CAG TAC AGG TGC TAC GGT GGA CAC 



300 



CCC AAG AAC AAG GCC AGA TTC TCC ATC CCA TCC ATG ACA CAG CAC CAT 348 
Pro Lys Asn Lys Ala Arg Phe Ser lie Pro Ser Met Thr Gin His His 
65 70 75 

GCA GGG AGA TAC CGC TGC CAC TAT TAC AGC TCT GCA GGC TGG TCA GAG 396 
Ala Gly Arg Tyr Arg Cys His Tyr Tyr Ser Ser Ala Gly Trp Ser Glu 
80 85 90 

CCC AGC GAC CCC CTG GAG CTG GTG ATG ACA GGA GCC TAT AGC AAA CCC 444 
Pro Ser Asp Pro Leu Glu Leu Val Met Thr Gly Ala Tyr Ser Lys Pro 
95 100 105 

ACC CTC TCA GCC CTG CCC AGC CCT GTG GTG GCC TCA GGG GGG AAT ATG 492 
Thr Leu Ser Ala Leu Pro Ser Pro Val Val Ala Ser Gly Gly Asn Met 
110 H5 120 

ACC CTC CGA TGT GGC TCA CAG AAG AGA TAT CAC CAT TTT GTT CTG ATG 
Thr Leu Arg Cys Gly Ser Gin Lys Arg Tyr His His Phe Val Leu Met 
125 130 135 140 

AAG GAA GGA GAA CAC CAG CTC CCC CGG ACC CTG GAC TCA CAG CAG CTC 
Lys Glu Gly Glu His Gin Leu Pro Arg Thr Leu Asp Ser Gin Gin Leu 
145 150 155 

CAC AGT GGG GGG TTC CAG GCC CTG TTC CCT GTG GGC CCC GTG AAC CCC 636 
His Ser Gly Gly Phe Gin Ala Leu Phe Pro Val Gly Pro Val Asn Pro 
160 165 170 

AGC CAC AGG TGG AGG TTC ACA TGC TAT TAC TAT TAT ATG AAC ACC CCC 684 
Ser His Arg Trp Arg Phe Thr Cys Tyr Tyr Tyr Tyr Met Asn Thr Pro 
175 180 185 

CGG GTG TGG TCC CAC CCC AGT GAC CCC CTG GAG ATT CTG CCC TCA GGC 732 
Arg Val Trp Ser His Pro Ser Asp Pro Leu Glu He Leu Pro Ser Gly 
190 195 200 



540 



588 



780 
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CCT GGG CAG AGC CTG ACC CTC CAG TGT GGC TCT GAT GTC GGC TAC GAC 
Pro Gly Gin Ser Leu Thr Leu Gin Cys Gly Ser Asp Val Gly Tyr Asp 
225 230 235 

AGA TTT GTT CTG TAT AAG GAG GGG GAA CGT GAC TTC CTC CAG CGC CCT 876 
Arg Phe Val Leu Tyr Lys Glu Gly Glu Arg Asp Phe Leu Gin Arg Pro 
240 245 250 



924 



972 
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Pro Val Ser Pro Ser Asn Gly Gly Gin Tyr Arg Cys Tyr Gly Ala His 
270 275 280 

AAC CTC TCC TCC GAG TGG TCG GCC CCC AGC GAC CCC CTG AAC ATC CTG 
Asn Leu Ser Ser Glu Trp Ser Ala Pro Ser Asp Pro Leu Asn He Leu 
285 290 295 300 

ATC GCA GGA CAG ATC TAT GAC ACC GTC TCC CTG TCA GCA CAG CCG GGC 
Met Ala Gly Gin He Tyr Asp Thr Val Ser Leu Ser Ala Gin Pro Gly 
305 310 315 



TTC CCC ATC AGT CCT GTG ACC TCA GCC CAC GCG GGG ACC TAC AGG TGC 
Phe Pro Met Ser Pro Val Thr Ser Ala His Ala Gly Thr Tyr Arg Cys 
365 370 375 380 



CCC ACA GGG CCG CCC TCC ACA CCT GGT CTG GGA AGA TAC CTG GAG GTT 
Pro Thr Gly Pro Pro Ser Thr Pro Gly Leu Gly Arg Tyr Leu Glu Val 
415 420 425 



1020 
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CCC ACA GTG GCC TCA GGA GAG AAC GTG ACC CTG CTG TGT CAG TCA TGG 1116 
Pro Thr Val Ala Ser Gly Glu Asn Val Thr Leu Leu Cys Gin Ser Trp 
320 325 330 

TGG CAG TTT GAC ACT TTC CTT CTG ACC AAA GAA GGG GCA GCC CAT CCC 1164 
Trp Gin Phe Asp Thr Phe Leu Leu Thr Lys Glu Gly Ala Ala His Pro 
335 340 345 

CCA CTG CGT CTG AGA TCA ATG TAC GGA GCT CAT AAG TAC CAG GCT GAA 1212 
Pro Leu Arg Leu Arg Ser Met Tyr Gly Ala His Lys Tyr Gin Ala Glu 
350 355 360 



1260 



TAC GGC TCA CGC AGC TCC AAC CCC TAC CTG CTG TCT CAC CCC AGT GAG 1308 
Tyr Gly Ser Arg Ser Ser Asn Pro Tyr Leu Leu Ser His Pro Ser Glu 
385 390 395 

CCC CTG GAG CTC GTG GTC TCA GGA CAC TCT GGA GGC TCC AGC CTC CCA 1356 
Pro Leu Glu Leu Val Val Ser Gly His Ser Gly Gly Ser Ser Leu Pro 
400 405 410 



1404 



TTG ATT GGG GTC TCG GTG GCC TTC GTC CTG CTG CTC TTC CTC CTC CTC 1452 
Leu He Gly Val Ser Val Ala Phe Val Leu Leu Leu Phe Leu Leu Leu 
430 435 440 

TTC CTC CTC CTC CGA CGT CAG CGT CAC AGC AAA CAC AGG ACA TCT GAC 1500 
Phe Leu Leu Leu Arg Arg Gin Arg His Ser Lys His Arg Thr Ser Asp 
445 450 455 460 

CAG AGA AAG ACT GAT TTC CAG CGT CCT GCA GGG GCT GCG GAG ACA GAG 1548 
Gin Arg Lys Thr Asp Phe Gin Arg Pro Ala Gly Ala Ala Glu Thr Glu 
465 470 475 

CCC AAG GAC AGG GGC CTG CTG AGG AGG TCC AGC CCA GCT GCT GAC GTC 1596 
Pro Lys Asp Arg Gly Leu Leu Arg Arg Ser Ser Pro Ala Ala Asp Val 
480 485 490 

CAG GAA GAA AAC CTC TAT GCT GCC GTG AAG GAC ACA CAG TCT GAG GAC 1644 
Gin Glu Glu Asn Leu Tyr Ala Ala Val Lys Asp Thr Gin Ser Glu Asp 
495 500 505 

GGG GTG GAG CTG GAC AGT CAG AGC CCA CAC GAT GAA GAC CCC CAC GCA 1692 
Gly Val Glu Leu Asp Ser Gin Ser Pro His Asp Glu Asp Pro His Ala 
510 515 520 



80 



GTG ACG TAT GCC CCG GTG AAA CAC TCC AGT CCT AGG AGA GAA ATG GCC 1740 
Val Thr Tyr Ala Pro Val Lys His Ser Ser Pro Arg Arg Glu Met Ala 
525 530 535 540 

TCT CCT CCT TCC CCA CTG TCT GGG GAA TTC CTG GAC ACA AAG GAC AGA 1788 
Ser Pro Pro Ser Pro Leu Ser Gly Glu Phe Leu Asp Thr Lys Asp Arg 
545 550 555 

CAG GCA GAA GAG GAC AGA CAG ATG GAC ACT GAG GCT GCT GCA TCT GAA 1836 
Gin Ala Glu Glu Asp Arg Gin Met Asp Thr Glu Ala Ala Ala Ser Glu 
560 565 570 

GCC TCC CAG GAT GTG ACC TAC GCC CAG CTG CAC AGC TTG ACC CTT AGA 1884 
Ala Ser Gin Asp Val Thr Tyr Ala Gin Leu His Ser Leu Thr Leu Arg 
575 580 585 

CGG AAG GCA ACT GAG CCT CCT CCA TCC CAG GAG TTC GAG TCA GTC AGA 1932 
Arg Lys Ala Thr Glu Pro Pro Pro Ser Gin Glu Phe Glu Ser Val Arg 
590 595 600 

TCA GCA TTG T GAGGCCC CAT CTCTACAAAA AATAAAACCA GTCCGGCGTG GTGGCACAA 1991 

Ser Ala Leu 

605 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Thr Pro Ala Leu Thr Ala Leu Leu Cys Leu Gly Leu Ser Leu Gly 
-16 -15 -10 -5 

Pro Arg Thr Arg Val Gin Ala Gly Pro Phe Pro Lys Pro Thr Leu Trp 
1 " 5 10 15 

Ala Glu Pro Gly Ser Val lie Ser Trp Gly Ser Pro Val Thr He Trp 
20 25 30 

Cys Gin Gly Ser Leu Glu Ala Gin Glu Tyr Gin Leu Asp Lys Glu Gly 
35 40 45 

Ser Pro Glu Pro Leu Asp Arg Asn Asn Pro Leu Glu Pro Lys Asn Lys 
50 55 60 

Ala Arg Phe Ser He Pro Ser Met Thr Gin His His Ala Gly Arg Tyr 
65 70 75 80 

Arg Cys His Tyr Tyr Ser Ser Ala Gly Trp Ser Glu Pro Ser Asp Pro 
85 90 95 

Leu Glu Leu Val Met Thr Gly Ala Tyr Ser Lys Pro Thr Leu Ser Ala 
100 105 HO • 

Leu Pro Ser Pro Val Val Ala Ser Gly Gly Asn Met Thr Leu Arg Cys 
115 120 125 
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Gly Ser Gin Lys Arg Tyr His His Phe Val Leu Met Lys Glu Gly Glu 
130 135 140 

His Gin Leu Pro Arg Thr Leu Asp Ser Gin Gin Leu His Ser Gly Gly 
145 150 155 160 

Phe Gin Ala Leu Phe Pro Val Gly Pro Val Asn Pro Ser His Arg Trp 
165 170 175 

Arg Phe Thr Cys Tyr Tyr Tyr Tyr Met Asn Thr Pro Arg Val Trp Ser 
180 185 190 

His Pro Ser Asp Pro Leu Glu He Leu Pro Ser Gly Val Ser Arg Lys 
195 200 205 

Pro Ser Leu Leu Thr Leu Gin Gly Pro Val Leu Ala Pro Gly Gin Ser 
210 215 220 

Leu Thr Leu Gin Cys Gly Ser Asp Val Gly Tyr Asp Arg Phe Val Leu 
225 230 235 240 

Tyr Lys Glu Gly Glu Arg Asp Phe Leu Gin Arg Pro Gly Gin Gin Pro 
245 250 255 

Gin Ala Gly Leu Ser Gin Ala Asn Phe Thr Leu Gly Pro Val Ser Pro 
260 265 270 

Ser Asn Gly Gly Gin Tyr Arg Cys Tyr Gly Ala His Asn Leu Ser Ser 
275 280 285 

Glu Trp Ser Ala Pro Ser Asp Pro Leu Asn He Leu Met Ala Gly Gin 
290 295 300 

He Tyr Asp Thr Val Ser Leu Ser Ala Gin Pro Gly Pro Thr Val Ala 
305 " 310 315 320 

Ser Gly Glu Asn Val Thr Leu Leu Cys Gin Ser Trp Trp Gin Phe Asp 
325 330 335 

Thr Phe Leu Leu Thr Lys Glu Gly Ala Ala His Pro Pro Leu Arg Leu 
340 345 350 

Arg Ser Met Tyr Gly Ala His Lys Tyr Gin Ala Glu Phe Pro Met Ser 
355 360 365 

Pro Val Thr Ser Ala His Ala Gly Thr Tyr Arg Cys Tyr Gly Ser Arg 
370 375 380 

Ser Ser Asn Pro Tyr Leu Leu Ser His Pro Ser Glu Pro Leu Glu Leu 
385 390 395 400 

Val Val Ser Gly His Ser Gly Gly Ser Ser Leu Pro Pro Thr Gly Pro 
405 410 415 

Pro Ser Thr Pro Gly Leu Gly Arg Tyr Leu Glu Val Leu He Gly Val 
420 425 430 

Ser Val Ala Phe Val Leu Leu Leu Phe Leu Leu Leu Phe Leu Leu Leu 
435 440 445 

Arg Arg Gin Arg His Ser Lys His Arg Thr Ser Asp Gin Arg Lys Thr 
450 455 460 
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Asp Phe Gin Arg Pro Ala Gly Ala Ala Glu Thr Glu Pro Lys Asp Arg 
465 470 475 480 

Gly Leu Leu Arg Arg Ser Ser Pro Ala Ala Asp Val Gin Glu Glu Asn 
485 490. 495 

Leu Tyr Ala Ala Val Lys Asp Thr Gin Ser Glu Asp Gly Val Glu Leu 
500 505 510 

Asp Ser Gin Ser Pro His Asp Glu Asp Pro His Ala Val Thr Tyr Ala 
515 520 525 

Pro Val Lys His Ser Ser Pro Arg Arg Glu Met Ala Ser Pro Pro Ser 
530 535 540 

Pro Leu Ser Gly Glu Phe Leu Asp Thr Lys Asp Arg Gin Ala Glu Glu 
545 550 555 560 

Asp Arg Gin Met Asp Thr Glu Ala Ala Ala Ser Glu Ala Ser Gin Asp 
565 570 575 

Val Thr Tyr Ala Gin Leu His Ser Leu Thr Leu Arg Arg Lys Ala Thr 
580 585 590 

Glu Pro Pro Pro Ser Gin Glu Phe Glu Ser Val Arg Ser Ala Leu 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 22.. 1438 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



sig_peptide 
22. .67 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



mat_peptide 
70. .1438 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GGAATTCCGG GAGGAGACGC C ATG ATC CCC ACC TTC ACG GCT CTG CTC TGC 

Met lie Pro Thr Phe Thr Ala Leu Leu Cys 
-16 -15 , -10 

CTC GGG CTG AGT CTG GGC CCC AGT ACC CAC ATG CAG GCA GGG CCC CTC 
Leu Gly Leu Ser Leu Gly Pro Ser Thr His Met Gin Ala Gly Pro Leu 
-5 1 5 10 

CCC AAA CCC ACC CTC TGG GCT GAG CCA GGC TCT GTG ATC AGC TGG GGG 
Pro Lys Pro Thr Leu Trp Ala Glu Pro Gly Ser Val He Ser Trp Gly 



51 



99 



147 
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15 20 25 

AAC TCT GTG ACC ATC TCG TGT CAG GGG ACC CTG GAG GCT CGG GAG TAC 195 
Asn Ser Val Thr lie Trp Cys Gin Gly Thr Leu Glu Ala Arg Glu Tyr 
30 35 40 

CGT CTG GAT AAA GAG GAA AGC CCA GCA CCC TGG GAC AGA CAG AAC CCA 243 
Arg Leu Asp Lys Glu Glu Ser Pro Ala Pro Trp Asp Arg Gin Asn Pro 
45 50 55 

CTG GAG CCC AAG AAC AAG GCC AGA TTC TCC ATC CCA TCC ATG ACA GAG 291 
Leu Glu Pro Lys Asn Lys Ala Arg Phe Ser He Pro Ser Met Thr Glu 
60 65 70 

GAC TAT GCA GGG AGA TAC CGC TGT TAC TAT CGC AGC CCT GTA GGC TGG 339 
Asp Tyr Ala Gly Arg Tyr Arg Cys Tyr Tyr Arg Ser Pro Val Gly Trp 
75 80 85 90 



TCA CAG CCC AGT GAC CCC CTG GAG CTG GTG ATG ACA GGA GCC TAC AGT 
Ser Gin Pro Ser Asp Pro Leu Glu Leu Val Met Thr Gly Ala Tyr Ser 
95 100 105 



CAC GGA GCT CAG CAG CAC CAG GCT GAA TTC CCC ATG AGT CCT GTG ACC 
His Gly Ala Gin Gin His Gin Ala Glu Phe Pro Met Ser Pro Val Thr 
155 160 165 170 

TCA GTG CAC GGG GGG ACC TAC AGG TGC TTC AGC TCA CAC GGC TTC TCC 
Ser Val His Gly Gly Thr Tyr Arg Cys Phe Ser Ser His Gly Phe Ser 
175 180 185 



CAC AGT GGT CTG AGA AGG CAC TGG GAG GTA CTG ATC GGG GTC TTG GTG 
His Ser Gly Leu Arg Arg His Trp Glu Val Leu He Gly Val Leu Val 
240 245 250 
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AAA CCC ACC CTT TCA GCC CTG CCG AGT CCT CTT GTG ACC TCA GGA AAG 435 
Lys Pro Thr Leu Ser Ala Leu Pro Ser Pro Leu Val Thr Ser Gly Lys 
110 H5 120 

AGC GTG ACC CTG CTG TGT CAG TCA CGG AGC CCA ATG GAC ACT TTT CTT 483 
Ser Val Thr Leu Leu Cys Gin Ser Arg Ser Pro Met Asp Thr Phe Leu 
125 130 135 

CTG ATC AAG GAG CGG GCA GCC CAT CCC CTA CTG CAT CTG AGA TCA GAG 531 
Leu He Lys Glu Arg Ala Ala His Pro Leu Leu His Leu Arg Ser Glu 
140 145 150 
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CAC TAC CTG CTG TCA CAC CCC AGT GAC CCC CTG GAG CTC ATA GTC TCA 675 
His Tyr Leu Leu Ser His Pro Ser Asp Pro Leu Glu Leu He Val Ser 
190 195 200 

GGA TCC TTG GAG GGT CCC AGG CCC TCA CCC ACA AGG TCC GTC TCA ACA 723 
Gly Ser Leu Glu Gly Pro Arg Pro Ser Pro Thr Arg Ser Val Ser Thr 
205 210 215 

GCT GCA GGC CCT GAG GAC CAG CCC CTC ATG CCT ACA GGG TCA GTC CCC 771 
Ala Ala Gly Pro Glu Asp Gin Pro Leu Met Pro Thr Gly Ser Val Pro 
220 225 230 



819 
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GTC TCC ATC CTG CTT CTC TCC CTC CTC CTC TTC 

Val Ser He Leu Leu Leu Ser Leu Leu Leu Phe Leu Leu Leu Gin His 
255 260 265 

TGG CGT CAG GGA AAA CAC AGG ACA TTG GCC CAG AGA CAG GCT GAT TTC 915 
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Trp Arg Gin Gly Lys His Arg Thr Leu Ala Gin Arg Gin Ala Asp Phe 
270 275 280 

CAA CGT CCT CCA GGG GCT GCC GAG CCA GAG CCC AAG GAC GGG GGC CTA 
Gin Arg Pro Pro Gly Ala Ala Glu Pro Glu Pro Lys Asp Gly Gly Leu 
285 290 295 



GCT GCC GTG AAG GAC ACA CAG CCT GAG GAC GGG GTG GAA ATG GAC ACT 
Ala Ala Val Lys Asp Thr Gin Pro Glu Asp Gly Val Glu Met Asp Thr 
320 325 330 



315 
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CAG AGG AGG TCC AGC CCA GCT GCT GAC GTC CAG GGA GAA AAC TTC TGT 1011 
Gin Arg Arg Ser Ser Pro Ala Ala Asp Val Gin Gly Glu Asn Phe Cys 
300 305 310 
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CGG CAG AGC CCA CAC GAT GAA GAC CCC CAG GCA GTG ACG TAT GCC AAG 1107 
Arg Gin Ser Pro His Asp Glu Asp Pro Gin Ala Val Thr Tyr Ala Lys 
335 340 345 

GTG AAA CAC TCC AGA CCT AGG AGA GAA ATG GCC TCT CCT CCC TCC CCA 1155 
Val Lys His Ser Arg Pro Arg Arg Glu Met Ala Ser Pro Pro Ser Pro 
350 355 360 

CTG TCT GGG GAA TTC CTG GAC ACA AAG GAC AGA CAG GCA GAA GAG GAC 1203 
Leu Ser Gly Glu Phe Leu Asp Thr Lys Asp Arg Gin Ala Glu Glu Asp 
365 370 375 

AGA CAG ATG GAC ACT GAG GCT GCT GCA TCT GAA GCC CCC CAG GAT GTG 1251 
Arg Gin Met Asp Thr Glu Ala Ala Ala Ser Glu Ala Pro Gin Asp Val 
380 385 390 

ACT ACG CCC GGC TGC ACA GCT TTA CCC TCA GAC AGA AGG CAA CTG AGC 1299 
Thr Thr Pro Gly Cys Thr Ala Leu Pro Ser Asp Arg Arg Gin Leu Ser 
395 400 405 410 

CTC CTC CAT CCC AGG AAG GGG CCT CTC CAG CTG AGC CCA GTG TCT ATG 1347 
Leu Leu His Pro Arg Lys Gly Pro Leu Gin Leu Ser Pro Val Ser Met 
415 420 425 

CCA CTC TGG CCA TCC ACT AAT CCA GGG GGG ACC CAG ACC CCA CAA GCC 1395 
Pro Leu Trp Pro Ser Thr Asn Pro Gly Gly Thr Gin Thr Pro Gin Ala 
430 435 440 

ATG GAG ACT CAG GAC CCC AGA AGG CAT GGA AGC TGC CTC CAG T 1438 
Met Glu Thr Gin Asp Pro Arg Arg His Gly Ser Cys Leu Gin 
445 450 455 

AGACATCACT GAACCCCAGC CAGCCCAGAC CCCTGACACA GACCACTAGA AGATTCCGGG 1498 

AACGTTGGGA GTCACCTGAT TCTGCAAAGA TAAATAATAT CCCTGCATTA TC 1550 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) 
(xi) 



MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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Met lie Pro Thr Phe Thr Ala Leu Leu Cys Leu Gly Leu Ser Leu Gly 
-16 -15 ' -10 -5 

Pro Ser Thr His Met Gin Ala Gly Pro Leu Pro Lys Pro Thr Leu Trp 
15 10 15 

Ala Glu Pro Gly Ser Val lie Ser Trp Gly Asn Ser Val Thr He Trp 
20 25 30 

Cys Gin Gly Thr Leu Glu Ala Arg Glu Tyr Arg Leu Asp Lys Glu Glu 
35 40 45 

Ser Pro Ala Pro Trp Asp Arg Gin Asn Pro Leu Glu Pro Lys Asn Lys 
50 55 60 

Ala Arg Phe Ser He Pro Ser Met Thr Glu Asp Tyr Ala Gly Arg Tyr 
65 70 75 80 

Arg Cys Tyr Tyr Arg Ser Pro Val Gly Trp Ser Gin Pro Ser Asp Pro 
85 90 95 

Leu Glu Leu Val Met Thr Gly Ala Tyr Ser Lys Pro Thr Leu Ser Ala 
100 105 HO 

Leu Pro Ser Pro Leu Val Thr Ser Gly Lys Ser Val Thr Leu Leu Cys 
115 120 . 125 

Gin Ser Arg Ser Pro Met Asp Thr Phe Leu Leu He Lys Glu Arg Ala 
130 135 140 

Ala His Pro Leu Leu His Leu Arg Ser Glu His Gly Ala Gin Gin His 
145 150 155 160 

Gin Ala Glu Phe Pro Met Ser Pro Val Thr Ser Val His Gly Gly Thr 
165 170 175 

Tyr Arg Cys Phe Ser Ser His Gly Phe Ser His Tyr Leu Leu Ser His 
180 185 190 

Pro Ser Asp Pro Leu Glu Leu He Val Ser Gly Ser Leu Glu Gly Pro 
195 200 205 

Arg Pro Ser Pro Thr Arg Ser Val Ser Thr Ala Ala Gly Pro Glu Asp 
210 215 220 

Gin Pro Leu Met Pro Thr Gly Ser Val Pro His Ser Gly Leu Arg Arg 
225 230 235 240 

His Trp Glu Val Leu He Gly Val Leu Val Val Ser lie Leu Leu Leu 
245 250 255 

Ser Leu Leu Leu Phe Leu Leu Leu Gin His Trp Arg Gin Gly Lys His 
260 265 270 

Arg Thr Leu Ala Gin Arg Gin Ala Asp Phe Gin Arg Pro Pro Gly Ala 
275 280 285 

Ala Glu Pro Glu Pro Lys Asp Gly Gly Leu Gin Arg Arg Ser Ser Pro 
290 295 300 

Ala Ala Asp Val Gin Gly Glu Asn Phe Cys Ala Ala Val Lys Asp Thr 
305 ~ 310 315 320 
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Gin Pro Glu Asp Gly Val Glu Met Asp Thr Arg Gin Ser Pro His Asp 
325 330 335 

Glu Asp Pro Gin Ala Val Thr Tyr Ala Lys Val Lys His Ser Arg Pro 
340 345 350 

Arg Arg Glu Met Ala Ser Pro Pro Ser Pro Leu Ser Gly Glu Phe Leu 
355 360 365 

Asp Thr Lys Asp Arg Gin Ala Glu Glu Asp Arg Gin Met Asp Thr Glu 
370 375 380 

Ala Ala Ala Ser Glu Ala Pro Gin Asp Val Thr Thr Pro Gly Cys Thr 
385 390 395 400 

Ala Leu Pro Ser Asp Arg Arg Gin Leu Ser Leu Leu His Pro Arg Lys 
405 410 415 

Gly Pro Leu Gin Leu Ser Pro Val Ser Met Pro Leu Trp Pro Ser Thr 
420 425 430 

Asn Pro Gly Gly Thr Gin Thr Pro Gin Ala Met Glu Thr Gin Asp Pro 
435 440 445 

Arg Arg His Gly Ser Cys Leu Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 46.. 1588 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 94.. 1588 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 46 . . 91 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GCAGGAATTC GGCACGAGCA GAGCAGGGCA GTGGGAGGAG ACGCC ATG ACC CCC 54 

Met Thr Pro 
-16 -15 

ATC CTC ACG GTC CTG ATC TGT CTC GGG CTG AGT CTG GGC CCC AGG ACC 102 
He Leu Thr Val Leu He Cys Leu Gly Leu Ser Leu Gly Pro Arg Thr 
-10 -5 1 

CAC GTC CAG GCA GGG CAC CTC CCC AAG CCC ACC CTC TGG GCT GAG CCA 150 
His Val Gin Ala Gly His Leu Pro Lys Pro Thr Leu Trp Ala Glu Pro 
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5 , 10 15 

GGC TCT GTG ATC ATC CAG GGA AGT CCT GTG ACC CTC AGG TGT CAG GGG 
Gly Ser Val lie He Gin Gly Ser Pro Val Thr Leu Arg Cys Gin Gly 
20 25 30 35 

AGC CTT CAG GCT GAG GAG TAC CAT CTA TAT AGG GAA AAC AAA TCA GCA 
Ser Leu Gin Ala Glu Glu Tyr His Leu Tyr Arg Glu Asn Lys Ser Ala 
40 45 50 

TCC TGG GTT AGA CGG ATA CAA GAG CCT GGG AAG AAT GGC CAG TTC CCC 
Ser Trp Val Arg Arg He Gin Glu Pro Gly Lys Asn Gly Gin Phe Pro 
55 60 65 

ATC CCA TCC ATC ACC TGG GAA CAC GCA GGG CGG TAT CAC TGT CAG TAC 
He Pro Ser He Thr Trp Glu His Ala Gly Arg Tyr His Cys Gin Tyr 
70 75 80 

TAC AGC CAC AAT CAC TCA TCA GAG TAC AGT GAC CCC CTG GAG CTG GTG 
Tyr Ser His Asn His Ser Ser Glu Tyr Ser Asp Pro Leu Glu Leu Val 
85 90 95 

GTG ACA GGA GCC TAC AGC AAA CCC ACC CTC TCA GCT CTG CCC AGC CCT 
Val Thr Gly Ala Tyr Ser Lys Pro Thr Leu Ser Ala Leu Pro Ser Pro 
100 105 HO H5 

GTG GTG ACC TTA GGA GGG AAC GTG ACC CTC CAG TGT GTC TCA CAG GTG 
Val Val Thr Leu Gly Gly Asn Val Thr Leu Gin Cys Val Ser Gin Val 
120 125 130 

GCA TTT GAC GGC TTC ATT CTG TGT AAG GAA GGA GAA GAT GAA CAC CCA 
Ala Phe Asp Gly Phe He Leu Cys Lys Glu Gly Glu Asp Glu His Pro 
135 140 145 

CAA CGC CTG AAC TCC CAT TCC CAT GCC CGT GGG TGG TCC TGG GCC ATC 
Gin Arg Leu Asn Ser His Ser His Ala Arg Gly Trp Ser Trp Ala He 
150 155 160 

TTC TCC GTG GGC CCC GTG AGC CCG AGT CGC AGG TGG TCG TAC AGG TGC 
Phe Ser Val Gly Pro Val Ser Pro Ser Arg Arg Trp Ser Tyr Arg Cys 
165 170 175 

TAT GCT TAT. GAC TCG AAC TCT CCC TAT GTG TGG TCT CTA CCC AGT GAT 
Tyr Ala Tyr Asp Ser Asn Ser Pro Tyr Val Trp Ser Leu Pro Ser Asp 
180 ~ 185 190 195 

CTC CTG GAG CTC CTG GTC CCA GGT GTT TCT AAG AAG CCA TCA CTC TCA 
Leu Leu Glu Leu Leu Val Pro Gly Val Ser Lys Lys Pro Ser Leu Ser 
200 205 210 

GTG CAG CCA GGT CCT ATG GTG GCC CCT GGG GAG AGC CTG ACC CTC CAG 
Val Gin Pro Gly Pro Met Val Ala Pro Gly Glu Ser Leu Thr Leu Gin 
215 220 225 

TGT GTC TCT GAT GTC GGC TAC GAC AGA TTT GTT CTG TAT AAG GAG GGA 
Cys Val Ser Asp Val Gly Tyr Asp Arg Phe Val Leu Tyr Lys Glu Gly 
230 235 240 

GAA CGT GAC TTC CTC CAG CGC CCT GGT TGG CAG CCC CAG GCT GGG CTC 
Glu Arg Asp Phe Leu Gin Arg Pro Gly Trp Gin Pro Gin Ala Gly Leu 
245 250 255 



TCC CAG GCC AAC TTC ACC CTG GGC CCT GTG AGC CCC TCC CAC GGG GGC 
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Ser Gin Ala Asn Phe Thr Leu Gly Pro Val Ser Pro Ser His Gly Gly 
260 265 270 275 

GAG TAC AGA TGC TAG AGT GCA CAC AAC CTC TCC TCC GAG TGG TCG GCC 966 
Gin Tyr Arg Cys Tyr Ser Ala His Asn Leu Ser Ser Glu Trp Ser Ala 
280 285 290 

CCC AGT GAC CCC CTG GAC ATC CTG ATC ACA GGA CAG TTC TAT GAC AGA 1014 
Pro Ser Asp Pro Leu Asp He Leu He Thr Gly Gin Phe Tyr Asp Arg 
295 300 305 

CCC TCT CTC TCG GTG CAG CCG GTC CCC ACA GTA GCC CCA GGA AAG AAC 1062 
Pro Ser Leu Ser Val Gin Pro Val Pro Thr Val Ala Pro Gly Lys Asn 
310 315 320 

GTG ACC CTG CTG TGT CAG TCA CGG GGG CAG TTC CAC ACT TTC CTT CTG 1110 
Val Thr Leu Leu Cys Gin Ser Arg Gly Gin Phe His Thr Phe Leu Leu 
325 330 335 

ACC AAG GAG GGG GCA GGC CAT CCC CCA CTG CAT CTG AGA TCA GAG CAC 1158 
Thr Lys Glu Gly Ala Gly His Pro Pro Leu His Leu Arg Ser Glu His 
340 345 350 355 

CAA GCT CAG CAG AAC CAG GCT GAA TTC CGC ATG GGT CCT GTG ACC TCA 1206 
Gin Ala Gin Gin Asn Gin Ala Glu Phe Arg Met Gly Pro Val Thr Ser 
360 365 370 

GCC CAC GTG GGG ACC TAC AGA TGC TAC AGC TCA CTC AGC TCC AAC CCC 1254 
Ala His Val Gly Thr Tyr Arg Cys Tyr Ser Ser Leu Ser Ser Asn Pro 
375 380 385 

TAC CTG CTG TCT CTC CCC AGT GAC CCC CTG GAG CTC GTG GTC TCA GCA 1302 
Tyr Leu Leu Ser Leu Pro Ser Asp Pro Leu Glu Leu Val Val Ser Ala 
390 395 400 

TCC CTA GGC CAA CAC CCC CAG GAT TAC ACA GTG GAG AAT CTC ATC CGC 1350 
Ser Leu Gly Gin His Pro Gin Asp Tyr Thr Val Glu Asn Leu He Arg 
405 410 415 

ATG GGT GTG GCT GGC TTG GTC CTG GTG GTC CTC GGG ATT CTG CTA TTT 1398 
Met Gly Val Ala Gly Leu Val Leu Val Val Leu Gly He Leu Leu Phe 
420 ~ 425 430 435 

GAG GCT CAG CAC AGC CAG AGA AGC CTA CAA GAT GCA GCC GGG AGT GAA 1446 
Glu Ala Gin His Ser Gin Arg Ser Leu Gin Asp Ala Ala Gly Ser Glu 
440 445 450 

CAG CAG AGA GGA CAA TGC ATC CTT CAG CGT GGT GGA GCC TCA GGG ACA 1494 
Gin Gin Arg Gly Gin Cys He Leu Gin Arg Gly Gly Ala Ser Gly Thr 
455 460 465 

GAT CTG ATG ATC CCA GGA GGC TCT GGA GGA CAA TCT AGG ACC TAC ATT 1542 
Asp Leu Met He Pro Gly Gly Ser Gly Gly Gin Ser Arg Thr Tyr He 
470 475 480 

ATC TGG ACT GTA TGC TGG TCA TTT CTA GAG ACA GCA ATC AAT ATT T 1588 
He Trp Thr Val Cys Trp Ser Phe Leu Glu Thr Ala He Asn He 
485 490 495 

GAGTGTAAGG AAACTGTCTG GGGTGATTCC TAGAAGATCA TTAAACTGTG GTACATTTTT 1648 



TTGTCTATG 



1657 
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(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Thr Pro lie Leu Thr Val Leu lie Cys Leu Gly Leu Ser Leu Gly 
-16 -15 -10 -5 

Pro Arg Thr His Val Gin Ala Gly His Leu Pro Lys Pro Thr Leu Trp 
15 10 15 

Ala Glu Pro Gly Ser Val lie lie Gin Gly Ser Pro Val Thr Leu Arg 
20 25 30 

Cys Gin Gly Ser Leu Gin Ala Glu Glu Tyr His Leu Tyr Arg Glu Asn 
35 40 45 

Lys Ser Ala Ser Trp Val Arg Arg lie Gin Glu Pro Gly Lys Asn Gly 
50 55 60 

Gin Phe Pro lie Pro Ser lie Thr Trp Glu His Ala Gly Arg Tyr His 
65 70 75 80 

Cys Gin Tyr Tyr Ser His Asn His Ser Ser Glu Tyr Ser Asp Pro Leu 
85 90 95 

Glu Leu Val Val Thr Gly Ala Tyr Ser Lys Pro Thr Leu Ser Ala Leu 
100 105 110 

Pro Ser Pro Val Val Thr Leu Gly Gly Asn Val Thr Leu Gin Cys Val 
115 120 125 

Ser Gin Val Ala Phe Asp Gly Phe lie Leu Cys Lys Glu Gly Glu Asp 
130 135 140 

Glu His Pro Gin Arg Leu Asn Ser His Ser His Ala Arg Gly Trp Ser 
145 150 155 160 

Trp Ala lie Phe Ser Val Gly Pro Val Ser Pro Ser Arg Arg Trp Ser 
165 170 175 

Tyr Arg Cys Tyr Ala Tyr Asp Ser Asn Ser Pro Tyr Val Trp Ser Leu 
180 185 190 

Pro Ser Asp Leu Leu Glu Leu Leu Val Pro Gly Val Ser Lys Lys Pro 
195 200 205 

Ser Leu Ser Val Gin Pro Gly Pro Met Val Ala Pro Gly Glu Ser Leu 
210 215 220 

Thr Leu Gin Cys Val Ser Asp Val Gly Tyr Asp Arg Phe Val Leu Tyr 
225 230 235 240 

Lys Glu Gly Glu Arg Asp Phe Leu Gin Arg Pro Gly Trp Gin Pro Gin 
245 250 255 
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Ala Gly Leu Ser Gin Ala Asn Phe Thr Leu Gly Pro Val Ser Pro Ser 
260 265 270 

His Gly Gly Gin Tyr Arg Cys Tyr Ser Ala His Asn Leu Ser Ser Glu 
275 280 285 

Trp Ser Ala Pro Ser Asp Pro Leu Asp He Leu He Thr Gly Gin Phe 
290 295 300 

Tyr Asp Arg Pro Ser Leu Ser Val Gin Pro Val Pro Thr Val Ala Pro 
305 310 315 320 

Gly Lys Asn Val Thr Leu Leu Cys Gin Ser Arg Gly Gin Phe His Thr 
325 330 335 

Phe Leu Leu Thr Lys Glu Gly Ala Gly His Pro Pro Leu His Leu Arg 
340 345 350 

Ser Glu His Gin Ala Gin Gin Asn Gin Ala. Glu Phe Arg Met Gly Pro 
355 360 365 

Val Thr Ser Ala His Val Gly Thr Tyr Arg Cys Tyr Ser Ser Leu Ser 
370 375 380 

Ser Asn Pro Tyr Leu Leu Ser Leu Pro Ser Asp Pro Leu Glu Leu Val 
385 390 395 400 

Val Ser Ala Ser Leu Gly Gin His Pro Gin Asp Tyr Thr Val Glu Asn 
405 410 415 

Leu He Arg Met Gly Val Ala Gly Leu Val Leu Val Val Leu Gly He 
420 425 430 

Leu Leu Phe Glu Ala Gin His Ser Gin Arg Ser Leu Gin Asp Ala Ala 
435 440 445 

Gly Ser Glu Gin Gin Arg Gly Gin Cys He Leu Gin Arg Gly Gly Ala 
450 455 460 

Ser Gly Thr Asp Leu Met He Pro Gly Gly Ser Gly Gly Gin Ser Arg 
465 470 475 480 

Thr Tyr He He Trp Thr Val Cys Trp Ser Phe Leu Glu Thr Ala He 
485 490 495 

Asn He 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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Met Ala Pro Thr Leu Pro Ala Leu Leu Cys Leu Gly Leu Ser Val Gly 
15 10 15 

Leu Arg Thr Gin Val Gin Ala Gly Thr Phe Pro Lys Pro He He Trp 

' 20 25 30 

Ala Glu Pro Ser Ser Val Val Pro Leu Gly Ser Ser Val Thr He Leu 
35 40 45 



Cys Gin Gly Pro Pro Asn Thr Lys Ser Phe Ser Leu Asn Lys Glu Gly 
50 55 60 

Asp Ser Thr Pro Trp Asn He His Pro Ser Leu Glu Pro Trp Asp Lys 
65 70 75 80 

Ala Asn Phe Phe He Ser Asn Val Arg Glu Gin Gin Ala Gly Arg Tyr 
85 90 95 

His Cys Ser His Phe He Gly Val Asn Trp Ser Glu Pro Ser Glu Pro 
100 105 HO 

Leu Asp Leu Leu Val Ala Gly Glu Glu Pro Ala Gly Arg Leu Arg Asp 
115 120 125 

Arg Pro Ser Leu Ser Val Arg Pro Ser Pro Ser Val Ala Pro Gly Glu 
130 135 140 

Asn Val Thr Leu Leu Cys Gin Ser Gly Asn Arg Thr Asp Thr Phe Leu 
145 150 155 160 

Leu Ser Lys Glu Gly Ala Ala His Arg Pro Leu Arg Leu Arg Ser Gin 
165 170 175 

Asp Gin Asp Gly Trp Tyr Gin Ala Glu Phe Ser Leu Ser Pro Val Thr 
180 185 190 

Ser Ala His Gly Gly Thr Tyr Arg Cys Tyr Arg Ser Leu Ser Thr Asn 
195 200 205 

Pro Tyr Leu Leu Ser Gin Pro Ser Glu Pro Leu Ala Leu Leu Val Ala 
210 215 220 



Asp Tyr Thr Met Gin Asn Leu He Arg Met Gly Leu Ala Ala Ser Val 
225 "* 230 235 240 

Leu Leu Leu Leu Gly He Leu Leu Cys Gin Ala Arg His Asp His Gly 
245 250 255 



Gly Ala Arg Glu Ala Ala Arg Ser 
260 



INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGTCACGAGC CTNTGTCCTG GCCAGNCACC GAGGGNTCAT CCATCCACAG AGCAGTGCAG 60 

TGGGAGGAGC CGNGATGACC CCCATCCTCA AGGTCCTGAT CTGTCTCGGG CCCCTCCCCA 120 

AGCCNACCCT CTGGGCTGAG CCAGGCTCTG TGATCANCNA AGGGGAGTCC TGTANCCCTN 180 

AGGTGT 186 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGCAGNAGGG NCCCCTCCCC AAGCCCACCC TCTGGGACTG AGCCAGGCTC TGTAATCACC 60 

CAAGGNAGTC CTGTAACCCT NAGGTGTNAG GGGAGCCTGG AGACGCAGGA GTACCATCTA 120 

TATAGAGAAA AGAAAACAGC ACTCTGGATT ACACGGATCC CACAGGAGCT TGTAAAGAAG 180 

GGCCAGTTCC CCATCCTATC CATCACCTGG GAACATGCAG GGCGGTATTC TGTATCTTGG 240 

NAGCCACATT NAAGNCCTNT AGGGGCAGTN 270 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 300 base pairs. 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGTCCTNAAT CTGTNTCGAA GCCTACAGCA AACCCACCCT CTAAGCTCTG CCCAGCCCTG 60 

TGGTGACCTT AGGAGGGAAC GTGAACCCTC CAGTGTGTCT AAAAGGTGGN ATTTGAANGG 120 

CTTCATTCTG TGTAAGGTAA GGAGAAGATG AACACCCACA ACGCCTGANC TCCCATTCCN 180 

ATGCCCGTGG GTGGTCCTGG GCCATCTTCT CCGTGGGCCC CGTGAGCCCG AGTGGCAGTG 240 

GGTCGTACAG GTGCTATGNT TGNTAATTNG GAANTGTGCC CTATGANNTG GGTGTTNTAA 300 



(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCACGAGCC TACAGGTGCT ACGGCTCACT CAGCTCCAAC CCCTACCTGC TGACTCACCC 
CAGTNACCCC CTGGAGCTCG TGGTCTCAGG AGCAGCTGAG ACCCTCAGCC CACCACAAAA 
CAAGTCCGAC TCCAAGGCTG GTGAGTGAGG AGATGCTTGC CGTGATGACG CTGGGCACAG 
AGGGTCAGGT CCTNTCAAGA GGAGCTGGGT GTCCTNGGTG GACATTTNAA GAATTATATT 
NATTGCANCT TGA 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TAATTCGGCA GAGCGGGCGG GGACAATAAG GGAATAAAAG CTGGCGAGCN CGGCACCCCA 
CCAGAGTCTG CTTCCACGTT GTGGNAGCTT TGTTCTCTTG CTCTACACAA TAAATCTTGC 
TGCTGCTAAA AAAAAAAAAA AAAAAAAAAA AAAAAATTTN GGGGGTCCNA AAAAAAAAAG 
AAAGGGAAAG GNTTTTT 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AATTCGGCAG AGAGTNGTTC GGCCCCCAGC GACCCCCTGA ACATCCTGAT GGCAGGACAG 
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ATCTATGACA CCGTCTCCCT GTCAGCACAG CCGGGCCCCA CAGTGGCCTC AGGAGAGAAC 120 

GTGAACCCTG NCTGTGTCAG TNCATGGTGG TCAGTTTGAC ACTTTCCTTC TGACCAAAGA 180 

AGGGGCAGCC CATCCCCCAC TGCGTCTGAG ATCAATGTAC GGAGCTCATA AGTACCAGGC 240 

TGAAATTCCC CATGAGTCCT GTGAACCTCA GCCCACGCGG GGNACCTACA GGTGCTAACG 300 

GNTCACGNAG TTCCAACCCC CACCTGNTGT NTTCACCCCA GTNGGCNCCC TGGGAGCTCG 360 

TTGGTTTTCA GGACAATTTT GGGGGNTTNC ANNTTCCNAN GCCAAAAGGN CNGTCTTTCA 420 

NAANTCGTTT TGGGNAGATA ACTTGGNGGT TTTNAATTGG GGTTTTGGTT GGGCTTTGGG 480 

CNTGTTGTTT TGCCT 495 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AATTCGGCAG AGNCAGCACT GAGGGCTCAT CCCTCTGCAG AGCGCGGGGT CACCGGTAAG 60 

GAGACGCCAT GACGCCCGCC CTCACAGCCC TGCTCTGCCT TGGGCTGAGT CTGGGCCCCA 120 

GGAACCCGCG TGGCAGGCAG GGCCCTTCCC CAAACCCACC CTCTGGGGCT GAGCCAGGCT 180 

CTGTGAATCA GCTGGGGGGA GCCCCGTGAA CCATCTGGTG TCAGGGGGAG CCTGGAGGNC 240 

CAGGAGTACC AACTGGATAA AGAAGGGAAG CCCAGAGCCC TTGGGACAGA AATAACCCAC 300 

TGGGAACCCA AGGAACAAGG GCCAGATTTT TGCATNNCCA TTNCATGGAT ACAGNAACCT 360 

TGNNAGGGGA GGATTACCGG TTGGNCAATT ATTAACA 397 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AATTCGGCAN AGCCACAAGG TCCGTNTCAA CAGCTGGCCC TGAGGACCAG CCCCTTATGC 
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CTACAGGGTC AGTCCCNCAC AGTGGTCTGA GAAGGCACTG NGAGGTACTG ATCGGGGTCT 120 

TGGTGGTCTN CATCCTGCTT CTTTCCCTCC TCCTCTTCCT CCTCCTCCAA CACTGGCGTC 180 

AGGGAAAACA CAGGACATTG GCCCAGAGAC AGGCTGATTT CCAACGTCCT CCAGGGGCTG 240 

CCGAGCCAGA GCCCAAGGAC GGGGGCCTAC AGAGGAGGTN CAGCCCAGTT CTTGACGTTC 300 

CAGGGAGAAA AATTTTTTGN TTNCGTNAAG GACAAAAAAG CTTNGGGACG GGGTTGGAAT 360 

TGNCAATTGG GANNGCCCAN AAGTTTAAGA NCCCCNGGNA TTTANGNTTT NCAAAGTNTA 420 

AAAATTTCAN ACTTTGGGGG GAATT 445 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

AATTCGGCAC GAGGTACTGG GAGGTACTGA TCGGGGTCTT GGTGGTCTCC ANCCTGCTTC 60 

TNTCCNTCCT CCTCTTCCTC CTCCTCCAAC ACTGGCGTCA GGGAAAACAC AGGACATTGG 120 

CCCAGAGACA GGNTGATTTN CAACGTCCTC CAGGGGCTGC CGAGCCAGAG CCCAAGGACG 180 

GGGGCCTACA GAGGAGGTCC AGCCCAGNTG CTGACGTNCA GGGAGAAAAC TTNTGTGCTN 240 

CCGTGAAGAA CACACAGNCT 260 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGGAAAGCCC AGCACCCTTG ANNAGANAGA ACCCACTGGA GCCCAAGAAC AAGGCCAGAT 
TCTCGATCCC ATCCATGGAC AGAGGACTAT GCAGGGAGAT ACCGCTGTTA ACTATCGCAG 
CCCTGTAGGC TGGGTGCACA CCNAGGTGAA CCCCCTGNTG NCTGGGAGAT GGTCAGGAGC 
CTAGAGTAAA CCCACCCTTT AAGGCCCTGC CGAGTCCTCT TGTANCCNCA GTGAAAGAGC 
GTGACCCTGC TGTNT 
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(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AATTCCGGCC TCTCCTCCCT CCCCACTGTC TGGGGAATTC CTGGACACAA AGGACAGACA 
GGCAGAAGAG GACAGACAGA TGGACACTGA GGCTGCTGCA TCTAAAGCCC CCCAGGATGT 
GAACCTACGC CCAGCTGCAC AGTTTACCCT CAGACAGAAG GCAACTGAGC CTCCTCCATC 
CCAGGAAGGG GCCTCTNCAG CTGAGCCCAG TGTCTATGCC ACTCTGGCCA TCCACTTAAT 
CCAGGGGGGA CCCAGACCCC ACAAGCCATG GAGATTCAGG ACCNNAGAAG GCATGGAAGN 
TGCCTTCCAG TAGACATCAN TGAACCCCAG NCAGNCCAGA ACCTTNANAA AGACCATTAG 
AAGTTTTNGG GGAAGTTTGG GGGTGAATTG NTTTTTGGAA AGGTTAAATA ATTNTNNCTG 
GNATTTTTNA AATTAAAGTT GGGAGACTTT TTAATTTNNA ATGGGGTTTA TTGNTTNAAA 
AAANNNGNNN NGTNGNN 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AATTCGGCAC GACCTTTCCT CCTCCTCTCC CTCCTCCTCC TCCGTCAGCN TCACAGCAAA 
CACAGGACAT CTGACCAGAG AAAGACTGAT TTCCAGCGTC CTGCAGGGGC TGCGGAGACA 
GAGCCCAAGG ACAGGGGCCT GCTGAGGAGG TCCAGCCCAG CTGCTGACGT CCAGGAAGAA 
AACCTCTGTA AGAGGAAGAG AGGGGACAAA TGGGGGTGCT GGAGAGACAG GAGTCCCAAA 
ATTTCAGTAG CAACAGGGAG GGGCTGGGAA GGGTCTGGGG CTCCGTGGAA GATGGTCTTN 
CCCCACACTG TNGGGACCTC CCTGCATTCG GTGGCCCCTT TGGGAGCAGG GCAGGGGGCC 
AGCAGGATTN AGAGGTTTTA GAGAACCAGG NGANGANCCC TTTGTTTTGN CCCAGNAGTT 
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GTTGTTTTTN AGGGACAAAA ATTTTTNGGN NAGGTTGGAG TTTGNNNATT NAGAGCCCNA 480 
AGGTTNNGGN CCCCNGGNAT TNNGTTTTCC CC 512 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GGTCCTNAAT CTGTNTCGAA GCCTACAGCA AACCCACCCT ' CTAAGCTCTG CCCAGCCCTG 60 

TGGTGACCTT AGGAGGGAAC GTGAACCCTC CAGTGTGTCT AAAAGGTGGN ATTTGAANGG 120 

CTTCATTCTG TGTAAGGTAA GGAGAAGATG AACACCCACA ACGCCTGANC TCCCATTCCN 180 

ATGCCCGTGG GTGGTCCTGG GCCATCTTCT CCGTGGGCCC CGTGAGCCCG AGTGGCAGTG 240 

GGTCGTACAG GTGCTATGNT TGNTAATTNG GAANTGTGCC CTATGANNTG GGTGTTNTAA 300 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GACTCATGAC TGAGCCAGGC TCTGTGATCA CCC 33 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GACAGATCTC TCACCAGCCT TGGAGTC 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GACAGATCTG AGCCAGGCTC TGTGATCACC C 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GACTCTAGAG TCCACCCAGG ACACCCAGC 
(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTAGCCAGAT CTGCCACCAT GAACTCCTTC TCCACAAGCG CCTTCGGTCC AGTTGCCTTC 
TCCCTGGGGC TGCTCCTGGT GTTGCCTGCT GCCTTCCCTG CCCCAGTTGT GAGAGAGCCA 
GGCTCTGTGA TCACCC 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CAGCTCGAGC TCACCAGCCT TGGAGTC 
(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CTAGCCGGAT CCGCCACCAT GAACTCCTTC TCCACAAGCG CCTTCGGTCC AGTTGCCTTC 
TCCCTGGGGC TGCTCCTGGT GTTGCCTGCT GCCTTCCCTG CCCCAGTTGT GAGAGAGCCA 
GGCTCTGTGA TCACCC 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCTCTAGAGT CCACCCAGGA CACCCAGC 
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